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DETAILED ACTION 

Status 

1 . This instant application having Application No. 10/563,409. 

The preliminary amendment filed on December 30, 2005 is eligible to be entered. 
Claims 1-71 have been preliminarily canceled. 
Claims 72-142 are pending in this Office action. 

Oath/Declaration 

2. The Applicant's oath/declaration has been reviewed by the Examiner and is 
found to conform to the requirements prescribed in 37 C.F.R. 1.63. 

Information Disclosure Statement 

3. Applicant did not submit any Information Disclosure Statement. 

Priority I Filing Date 

4. The Applicant's claim for priority of PCT Application No. PCT/EP04/051 1 90 (filed 
on June 22, 2004), and .European Application No. EP 0342536.7 (filed on July 1, 2003) 
is confirmed. The Examiner takes the earliest foreign filing date of July 1 , 2003 into 
consideration. 

Abstract 

5. The abstract of the disclosure does not commence on a separate sheet in 
accordance with 37 CFR 1 .52(b)(4). A new abstract of the disclosure is required and 
must be presented on a separate sheet, apart from any other text. 
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Drawings 

6. The drawings were received on December 30, 2005. These drawings are 
acceptable for examination purposes. 

Claim Objections 

7. Claims 82, 99, 108, and 118 are objected because of informalities presented 
within the claims. 

Claim 82 contains a period (.) to indicate the end of a limitation on line 13. 

Claim 118 contains either a period (.) or nothing to indicate the end of limitations 
on lines 6, and 9-10. 

Accordingly, except the last limitation which may use a period (.) at the end, only 
colon (,) or semicolon (;) may be used to indicate the end of a limitation. Appropriate 
correction is required. 

Claim 99 contains a grammatical error "form" instead of "from" on line 9. 

Claim 108 contains a grammatical error "form" instead of "from" on line 9. 
Claim Rejections - 35 USC § 101 

8. 35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the 
conditions and requirements of this title. 

9. Claims 72-116 are rejected under 35 U.S.C. 101 because the claimed invention 
is directed to non-statutory subject matters. 

Regarding claims 72, 82, 99, and 108, each of the claim is not limited to 
statutory subject matter since the claims each merely recites an abstract idea with no 
physical transformation outside the computer to produce a practical result. Each of the 
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claim is no more than a preemptive idea of a motive to mathematically resolve the issue 
of projecting of data objects from a higher dimension to a lesser dimension. 

A principle, in the abstract, is a fundamental truth; an original cause; a motive; 
these cannot be patented, as no one can claim in either of them an exclusive right. As 
such, the claims are each nonstatutory. See Diehr, 450 U.S. at 186 and Gottschalk v. 
Benson, 409 U.S. 63, 71-72 (1972); Le Roy v. Tatham, 55 U.S. 156, 175 (1852); Funk 
Brothers Seed Co. v. Kalo Inoculant Co., 333 U.S. 127,132, 76 USPQ 280, 282 (1948). 

Claims 73-81, 83-98, 100-107, and 109-116 are also rejected under 35 U.S.C. 
101 because of their dependencies on claims 72, 82, 99, and 108 respectively. 
Claim Rejections - 35 USC §112 

1 0. The following is a quotation of the second paragraph of 35 U.S.C. 1 12: 

The specification shall conclude with one or more claims particularly pointing out and distinctly 
claiming the subject matter which the applicant regards as his invention. 

11. Claims 76, 107, 112-113, and 116 are rejected under 35 U.S.C. 112, second 
paragraph, as being indefinite for failing to particularly point out and distinctly claim the 
subject matter which applicant regards as the invention. 

Regarding claim 76, the abbreviation 'GenD' is not clearly defined in the claimed 
language, thus produces doubts to the readers regarding its particular meaning in the 
art. Appropriate correction is required. 

Regarding claim 107, the claim refers to " the structure of a molecule" in claim 82 
whilst claim 82 does not recite any antecedent basis for a molecule's structure. 
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Regarding claim 112, the claim refers to "the database of geographical sites" in 
claim 108 whilst claim 108 does not recite any antecedent basis for a database of 
geographical sites. 

Claims 113, and 116 are also rejected under 35 U.S.C. 112, 2 nd paragraph for 
their dependencies on claim 112. 

Claim Rejections - 35 USC § 103 

12. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

13. Claims 72-93, 96-97, and 117-142 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Aqrafiotis et al. (Pub. No. US 2002/0091655, published on July 11, 
2002; hereinafter Aqrafiotis ) in view of Stolte etal. (Pub. No. US 2004/0243593, filed on 
June 2, 2003; hereinafter Stolte ). 

Regarding claim 72, Aqrafiotis clearly shows and discloses a method for 
projecting information data from a multidimensional space onto a space having lesser 
dimensions ([0026]), comprising: 

providing a database of N-dimensional data in the form of records having a 
certain number of variables (a set of input patterns of dimensionality n, each with 
corresponding coordinates, [0090]); 

defining a metric function for calculating a distance between each record in the 
database (distant function d(), [0094]); 
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calculating a matrix of distances between each record in the database using said 
metric function (training set T is extracted from the set of input patterns n, the difference 
in distance between a random set of points X\. k and the set of reference points (both of 
which were extracted from the input space R" is calculated, [0096]-[0097]); 

defining a N-1 dimensional space in which each record is defined by N-1 
coordinates (for a linear from n to m dimensions, a simple 3-layer network with n input 
and m output units can be employed. The network is trained to reproduce the input- 
output coordinates produced by the iterative algorithm, and thus encodes the mapping 
in its synaptic parameters in a compact, analytical manner, [0090]); 

calculating the N-1 coordinates of each record in the N-1 dimensional space 
using an evolutionary algorithm (coordinates of a plurality of objects on the m- 
dimensional nonlinear map are determined by the algorithm described in [0050]-[0053]); 

defining a best projection of the records onto the N-1 dimensional space as a 
projection in which a distance matrix of the records in the N-1 dimensional space one of 
best fits or has minimum differences with the distance matrix of the records calculated in 
the N-dimensional space (Note that new patterns in R" that not in the original input set 
can also be projected into RT in the manner shown in Figure 6. Once the system is 
trained, new patterns in R" are mapped by identifying the nearest local network and 
using that network in a feed-forward manner to perform the projection. The input for the 
system is a pattern 705 in R". This point is defined by its n attributes, x is x 2 , . . . x„). The 
system includes a dispatcher module 710, which compares the distance of the input 
point to the network centers (i.e., the reference points), and forwards the input point to 
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one of the available local neural networks 701, 702, or 703. Specifically, the input 
pattern is sent to the local neural network associated with the reference point nearest to 
the input pattern. The chosen network then performs the final projection, resulting in an 
output point in FT, [0110]). 

Agrafiotis does not explicitly disclose the higher dimension is N while the lesser 
dimension N-1. 

However, Stolte discloses the higher dimension is N while the lesser dimension 
N-1 (Projection (Time, Product, Location -> Time, Location) reduces the dimensionality 
of an n-dimensional data cube to (n-1) by aggregating across a dimension. For 
example, in Figure 15, the first projection summarizes across "Location", reducing the 3- 
dimensional cube to a 2-dimensional cube, [0189]). 

It would have been obvious to an ordinary person skilled in the art at the time of 
the invention was made to incorporate the teachings of Stolte with the teachings of 
Agrafiotis for the purpose of providing an interactive visual exploration tool that 
facilitates exploratory analysis of databases having a hierarchical structure ([0038] of 
Stolte ). 

Regarding claim 73, Agrafiotis further discloses the database of N-dimensional 
data already contains data as to distances between the records (Figure 1 illustrates 
possibilities for a single hypothetical pairwise relationship and distances of 
corresponding objects on a nonlinear map, [0028]). 

Regarding claim 74, Agrafiotis further discloses said evolutionary algorithm is a 
genetic algorithm (The approach employs an iterative algorithm based on subset 
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refinements to nonlinearly map a small random sample which reflects the overall 
structure of the data, and then "learns" the underlying nonlinear transform using a set of 
distributed neural networks, each specializing in a particular domain of the feature 
space, [0045]). 

Regarding claim 75, Agrafiotis further discloses encoding each individual record 
or variable as a point having coordinates X and Y (reducing the dimensionality of high- 
dimensional data in a way that preserves the original relationships of the data objects, 
and 2) producing Cartesian coordinate vectors from data supplied directly in the form of 
similarities or proximities, [0012]); 

defining a set of different X and Y coordinates for each point forming a first 
population of projections solution onto the lesser dimensional space {for a nonlinear 
projection from n to m dimensions, a standard 3-layer neural network with n input and m 
output units is used. Each n-dimensional object is presented to the input layer, and its 
coordinates on the m-dimensional nonlinear map are obtained by the respective units in 
the output layer, [0021]); 

calculating a fitness score for each of the projections of the first population by 
using as a fitness function the matrix of distances of the single points in the original N- 
dimensional space {pairwise relationship and the current distance of the corresponding 
objects on the nonlinear map are illustrated in Figure 1, where shaded areas 110, 112 
and 114 denote allowed ranges for a given pairwise relationship, [0075]); 
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subjecting the population of projections to combination according to certain 
combinatorial rules to produce a first generation population of projections which 
comprises X and Y coordinates for the points which are a combination of the 
coordinates provided in two projections of the parent generation {Note that new patterns 
in R" that not in the original input set can also be projected into Ff in the manner shown 
in Figure 6. Once the system is trained, new patterns in R" are mapped by identifying 
the nearest local network and using that network in a feed-forward manner to perform 
the projection. The input for the system is a pattern 705 in Ff. This point is defined by its 
n attributes, x is x 2 , . . . x n ). The system includes a dispatcher module 710, which 
compares the distance of the input point to the network centers (i.e., the reference 
points), and forwards the input point to one of the available local neural networks 701, 
702, or 703. Specifically, the input pattern is sent to the local neural network associated 
with the reference point nearest to the input pattern. The chosen network then performs 
the final projection, resulting in an output point in FT, [01 10]); and 

calculating the fitness score of the projections of the first generation and forming 
a new generation basing on said first generation (The points (objects) in each cell of the 
grid are then subjected to a phase of local refinement (i.e. only objects from within the 
same cell are compared and refined). Preferably, the number of sampling steps in each 
cell should be proportional to the number of objects contained in that cell. This process 
is highly parallelizable. This local refinement phase is then followed by another global 
refinement phase, and the process is repeated for a prescribed number of cycles, or 
until the embedding error is minimized within a prescribed tolerance, [0086]). 
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Regarding claim 76, Agrafiotis further discloses wherein the genetic algorithm is 
the GenD algorithm (employ Sammon's nonlinear mapping algorithm to project a small 
random sample of objects from a given population, and then "learn" the underlying 
nonlinear transform using a multilayer neural network trained with the standard error 
back-propagation algorithm or some other equivalent technique, [0021]). 

Regarding claim 77, Agrafiotis further discloses a hidden point is defined which 
corresponds to a hidden record or a to a hidden variable whose existence is only 
guessed at, said hidden point being added to the parent population by giving it position 
coordinates X hi and Y hi in the projection (multi-layer feed-forward networks trained to 
reproduce their inputs as desired outputs. They consist of an input and an output layer 
containing as many neurons as the number of input dimensions, and a series of hidden 
layers having a smaller number of units. In the first part of the network, each sample is 
reorganized, mixed, and compressed into a compact representation encoded by the 
middle layer. This representation is then decompressed by the second part of the 
network to reproduce the original input, [001 1]). 

Regarding claim 78, Agrafiotis further discloses the calculation of the 
evolutionary algorithm is carried out in parallel with and without the hidden point and 
wherein best fit projections obtained from said two parallel calculations are compared (a 
phase of global refinement is initially carried out, after which, the resulting nonlinear 
map is partitioned into a regular grid. The points (objects) in each cell of the grid are 
then subjected to a phase of local refinement (i.e. only objects from within the same cell 
are compared and refined). Preferably, the number of sampling steps in each cell 
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should be proportional to the number of objects contained in that cell. This process is 
highly parallelizable. This local refinement phase is then followed by another global 
refinement phase, and the process is repeated for a prescribed number of cycles, or 
until the embedding error is minimized within a prescribed tolerance, [0086]). 

Regarding claim 79, Aqrafiotis further discloses: 

providing a database comprising at least two records, each record having a 
certain number of variables (a set of input patterns of dimensionality n, each with 
corresponding coordinates, [0090]); 

elaborating the database alternatively or in parallel: in a first manner in which the 
records are considered as being points and the variables as the coordinates of the 
points; and in a second manner in which the variables are considered as being points 
and the records as the coordinates of the points (training set T is extracted from the set 
of input patterns n, the difference in distance between a random set of points X hk and 
the set of reference points (both of which were extracted from the input space R" is 
calculated, [0096]-[0097]). 

Regarding claim 80, Aqrafiotis further discloses operating on the database in a 
pre-processing or post-processing phase {The method uses a self-organizing principle 
to iteratively refine an initial (random or partially ordered) configuration of objects by 
analyzing only a subset of objects and their associated relationships at a time, [0047]). 

Regarding claim 81, Aqrafiotis further discloses the database is processed in a 
preventive stage by means of a Self Organising Map algorithm (The method uses a self- 
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organizing principle to iteratively refine an initial (random or partially ordered) 
configuration of objects by analyzing only a subset of objects and their associated 
relationships at a time, [0047]), and wherein the clusters formed by this algorithm in the 
different units are projected to said lesser dimensional space {the reference points c, are 
determined using a clustering algorithm described in greater detail below. In step 325, 
the training set T is partitioned into c disjoint clusters based on the distance of each 
point Xj from each reference point. The training phase concludes with step 335, [0097]). 

Regarding claim 82, Aqrafiotis clearly shows and discloses a method for the 
cognitive analysis of multidimensional information data ([0026]), comprising: 

providing a database with a certain number of records, each record comprising a 
certain number of variables and being relative to a N-dimensional space (a set of input 
patterns of dimensionality n, each with corresponding coordinates, [0090]); 

projecting the database onto a space having a lesser number of dimensions 
relative to the N-dimensional space, considering the records as points and the variables 
as coordinates, or the variables as points and the records as coordinates {For a 
nonlinear projection from n to m dimensions, a simple 3-layer network with n input and 
m output units can be employed. The network is trained to reproduce the input/output 
coordinates produced by the iterative algorithm, and thus encodes the mapping in its 
synaptic parameters in a compact, analytical manner, [0090]); 

carrying out said projection using an algorithm for projecting information data 
belonging to a multidimensional space into a space having lesser dimensions by {The 
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process described herein uses the iterative nonlinear mapping algorithm described in 
Section II to multidimensionally scale a small random sample of a set of input patterns 
of dimensionality n, and then "learns" the underlying nonlinear transform using an 
artificial neural network, [0090]): 

calculating a matrix of distances between each point defined by a record or a 
variable of the database using a metric function (training set T is extracted from the set 
of input patterns n, the difference in distance between a random set of points X,.k and 
the set of reference points (both of which were extracted from the input space R" is 
calculated, [0096]-[0097]); 

defining a N-1 dimensional space in which each point represented by a record or 
a variable is defined by N-1 coordinates (for a linear from n to m dimensions, a simple 
3-layer network with n input and m output units can be employed. The network is trained 
to reproduce the input-output coordinates produced by the iterative algorithm, and thus 
encodes the mapping in its synaptic parameters in a compact, analytical manner, 
[0090]); 

calculating the N-1 coordinates of each point in the N-1 dimensional space using 
an evolutionary algorithm (coordinates of a plurality of objects on the m-dimensional 
nonlinear map are determined by the algorithm described in [0050]-[0053]); and 

defining the projection in which the distance matrix of the points in the N-1 
dimensional space best fits or has minimum differences with the distance matrix of the 
points calculated in the N-dimensional space as the best projection of said points onto 
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the N-1 dimensional space (Note that new patterns in R" that not in the original input set 
can also be projected into Ff in the manner shown in Figure 6. Once the system is 
trained, new patterns in Ff are mapped by identifying the nearest local network and 
using that network in a feed-forward manner to perform the projection. The input for the 
system is a pattern 705 in R". This point is defined by its n attributes, x is x 2 , . . . x n ). The 
system includes a dispatcher module 710, which compares the distance of the input 
point to the network centers (i.e., the reference points), and forwards the input point to 
one of the available local neural networks 701, 702, or 703. Specifically, the input 
pattern is sent to the local neural network associated with the reference point nearest to 
the input pattern. The chosen network then performs the final projection, resulting in an 
output point in FT, [0110]). 

Aqrafiotis does not explicitly disclose the higher dimension is N while the lesser 
dimension N-1 . 

However, Stolte discloses the higher dimension is N while the lesser dimension 
N-1 (Projection (Time, Product, Location -> Time, Location) reduces the dimensionality 
of an n-dimensional data cube to (n-1) by aggregating across a dimension. For 
example, in Figure 15, the first projection summarizes across "Location", reducing the 3- 
dimensional cube to a 2-dimensional cube, [01 89]). 

It would have been obvious to an ordinary person skilled in the art at the time of 
the invention was made to incorporate the teachings of Stolte with the teachings of 
Aqrafiotis for the purpose of providing an interactive visual exploration tool that 
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facilitates exploratory analysis of databases having a hierarchical structure ([0038] of 
Stoke). 

Regarding claim 83, Aqrafiotis further discloses the database already contains 
distances between the records (Figure 1 illustrates possibilities for a single hypothetical 
pairwise relationship and distances of corresponding objects on a nonlinear map, 
[0028]). 

Regarding claim 84, Aqrafiotis further discloses the evolutionary algorithm is a 
genetic algorithm (The approach employs an iterative algorithm based on subset 
refinements to nonlinearly map a small random sample which reflects the overall 
structure of the data, and then "learns" the underlying nonlinear transform using a set of 
distributed neural networks, each specializing in a particular domain of the feature 
space, [0045]). 

Regarding claim 85, Aqrafiotis further discloses said analysis: 

encodes each individual record or variable as a point having coordinates X and Y 
(reducing the dimensionality of high-dimensional data in a way that preserves the 
original relationships of the data objects, and 2) producing Cartesian coordinate vectors 
from data supplied directly in the form of similarities or proximities, [0012]); 

defines a set of different X and Y coordinates for each point forming a first 
population of projections solution onto the less dimensional space, usually a two or 
three dimensional space (for a nonlinear projection from n to m dimensions, a standard 
3-layer neural network with n input and m output units is used. Each n-dimensional 
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object is presented to the input layer, and its coordinates on the m-dimensional 
nonlinear map are obtained by the respective units in the output layer, [0021]); 

calculates a fitness score for each of the projections of this first population by 
using the matrix of distances of the single points in the original N-dimensional space as 
a fitness function (pairwise relationship and the current distance of the corresponding 
objects on the nonlinear map are illustrated in Figure 1, where shaded areas 110, 112 
and 114 denote allowed ranges for a given pairwise relationship, [0075]); 

combines the population of projections according to certain combinatorial rules to 
produce a first generation population of projections which comprises X and Y 
coordinates for the points which are a combination of the coordinates provided in two 
projections of the parent generation (Note that new patterns in Ff that not in the original 
input set can also be projected into Ff in the manner shown in Figure 6. Once the 
system is trained, new patterns in Ff are mapped by identifying the nearest local 
network and using that network in a feed-forward manner to perform the projection. The 
input for the system is a pattern 705 in Ff. This point is defined by its n attributes, x is 
x 2 , . . . x n ). The system includes a dispatcher module 710, which compares the distance 
of the input point to the network centers (i.e., the reference points), and forwards the 
input point to one of the available local neural networks 701, 702, or 703. Specifically, 
the input pattern is sent to the local neural network associated with the reference point 
nearest to the input pattern. The chosen network then performs the final projection, 
resulting in an output point in R m , [0110]); 



Application/Control Number: 10/563,409 Page 17 

Art Unit: 2165 

calculates a fitness score of the projections of the first generation; and forms a 
new generation based on the first generation (The points (objects) in each cell of the 
grid are then subjected to a phase of local refinement (i.e. only objects from within the 
same cell are compared and refined). Preferably, the number of sampling steps in each 
cell should be proportional to the number of objects contained in that cell. This process 
is highly parallelizable. This local refinement phase is then followed by another global 
refinement phase, and the process is repeated for a prescribed number of cycles, or 
until the embedding error is minimized within a prescribed tolerance, [0086]). 

Regarding claim 86, Aqrafiotis further discloses the evolutionary algorithm is the 
GenD algorithm (employ Sammon's nonlinear mapping algorithm to project a small 
random sample of objects from a given population, and then "learn" the underlying 
nonlinear transform using a multilayer neural network trained with the standard error 
back-propagation algorithm or some other equivalent technique, [0021]). 

Regarding claim 87, Aqrafiotis further discloses a hidden point represented by a 
hidden record or a hidden variable can be defined, which corresponds to a hidden point 
on the map and whose existence is only guessed at, said hidden point being added to 
the parent population by giving it position coordinates X hi and Y hi in the projection (multi- 
layer feed-forward networks trained to reproduce their inputs as desired outputs. They 
consist of an input and an output layer containing as many neurons as the number of 
input dimensions, and a series of hidden layers having a smaller number of units. In the 
first part of the network, each sample is reorganized, mixed, and compressed into a 
compact representation encoded by the middle layer. This representation is then 
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decompressed by the second part of the network to reproduce the original input, 
[0011]). 

Regarding claim 88, Aqrafiotis further discloses the calculation of the 
evolutionary algorithm is carried out in parallel, with and without the hidden point, and 
best fit projections obtained by said two parallel calculations are compared (a phase of 
global refinement is initially carried out, after which, the resulting nonlinear map is 
partitioned into a regular grid. The points (objects) in each cell of the grid are then 
subjected to a phase of local refinement (i.e. only objects from within the same cell are 
compared and refined). Preferably, the number of sampling steps in each cell should be 
proportional to the number of objects contained in that cell. This process is highly 
parallelizable. This local refinement phase is then followed by another global refinement 
phase, and the process is repeated for a prescribed number of cycles, or until the 
embedding error is minimized within a prescribed tolerance, [0086]). 

Regarding claim 89, Aqrafiotis further discloses: 

providing a database comprising a certain number of records each one having a 
certain number of variables (a set of input patterns of dimensionality n, each with 
corresponding coordinates, [0090]); 

elaborating the database alternatively or in parallel according to: a first manner 
by which the records are considered as being points and the variables as being the 
coordinates of the points; and a second manner by which the variables are considered 
as being points and the records are the coordinates (training set T is extracted from the 
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set of input patterns n, the difference in distance between a random set of points X\. k 
and the set of reference points (both of which were extracted from the input space R" is 
calculated, [0096]-[0097]). 

Regarding claim 90. Aqrafiotis further discloses using a different algorithm to 
operate on the database in a pre-processing or a post-processing phase {The method 
uses a self-organizing principle to iteratively refine an initial (random or partially 
ordered) configuration of objects by analyzing only a subset of objects and their 
associated relationships at a time, [0047]). 

Regarding claim 91 , Aqrafiotis further discloses the database is processed in a 
preventive stage by means of a Self Organising Map algorithm (The method uses a self- 
organizing principle to iteratively refine an initial (random or partially ordered) 
configuration of objects by analyzing only a subset of objects and their associated 
relationships at a time, [0047]), and wherein the clusters formed by said SOP algorithm 
in the different units are projected onto said lesser dimensional space {the reference 
points Q are determined using a clustering algorithm described in greater detail below. 
In step 325, the training set T is partitioned into c disjoint clusters based on the distance 
of each point x, from each reference point. The training phase concludes with step 335, 
[0097]). 

Regarding claim 92, Aqrafiotis further discloses the distance of the points on the 
map onto which the database has been projected is used as a measure of similarity of 
the records or variables related to said points {The general algorithm can also be 
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applied when the pairwise similarity matrix is incomplete, i.e. when some of the pairwise 
similarities are unknown, when some of the pairwise similarities are uncertain or corrupt, 
or both of the above, [0066]). 

Regarding claim 93, Aqrafiotis further discloses: 

a database is provided comprising a certain number of records, each record 
related to a certain number of variables (a set of input patterns of dimensionality n, each 
with corresponding coordinates, [0090]); 

complementary variables to the variables originally provided are added to the 
database (the basic algorithm does not distinguish weak from strong relationships 
(short-range and long-range distances, respectively). One method to ensure that strong 
relationships are preserved more faithfully than weak relationships is to weight the 
coordinate update in Equation 5 (or, equivalently, the learning rate A in Equations 7 and 
8) by a scaling factor that is inversely proportional to the strength (magnitude) of the 
relationship, [0085]); 

the integrated database is projected onto a lesser dimensional space, particularly 
on a two or three dimensional space (Note that new patterns in Ff that not in the 
original input set can also be projected into Ff 1 in the manner shown in Figure 6. Once 
the system is trained, new patterns in Ft" are mapped by identifying the nearest local 
network and using that network in a feed-forward manner to perform the projection. The 
input for the system is a pattern 705 in Ff. This point is defined by its n attributes, x is 
x 2 , . . . x n ). The system includes a dispatcher module 710, which compares the distance 
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of the input point to the network centers (i.e., the reference points), and forwards the 
input point to one of the available local neural networks 701, 702, or 703. Specifically, 
the input pattern is sent to the local neural network associated with the reference point 
nearest to the input pattern. The chosen network then performs the final projection, 
resulting in an output point in R m , [01 1 0]); and 

the distance between each variable and its complementary variable in said 
mapping is used as a measure of the relevance of said variable in the database 
(training set T is extracted from the set of input patterns n, the difference in distance 
between a random set of points X\. k and the set of reference points (both of which were 
extracted from the input space R" is calculated, [0096]-[0097]). 

Regarding claim 96, Aqrafiotis further discloses implemented as a set of 
computer executable instructions saved on a removable computer readable storage 
medium (removable storage unit storing computer software, [0120]). 

Regarding claim 97, Aqrafiotis further discloses implemented as a set of 
computer executable instructions saved on a removable computer readable storage 
medium (removable storage unit storing computer software, [0120]). 

Regarding claim 117, Aqrafiotis further discloses an apparatus having artificial 
intelligence provided with a processing unit, said processing unit connected to a data 
memory and to a program memory ([0120]), wherein: 

the processing unit is further connected to one or more different sensors for 
detecting or measuring different physical and/or chemical conditions or effects or 
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processes characterizing or occurring in an environment (Figure 11 shows the computer 
system 1000 includes one or more processors, such as processor 1 1 04. The processor 
1104 is connected to a communication infrastructure 1106 (e.g., a bus or network), 
[01 1 9]. See further [001 1 ] for neurons acting as sensors); 

the processing unit is further connected to data input means by means of a 
service person or a data input line from other data collecting apparati; the processing 
unit is also connected to means for carrying out mechanical, physical or chemical 
actions, such as actuators or the like (Secondary memory 1110 can also include other 
similar means for allowing computer programs or input data to be loaded into computer 
system 1100. Such means may include, for example, a removable storage unit 1122 
and an interface 1 120. Examples of such may include a program cartridge and cartridge 
interface (such as that found in video game devices), a removable memory chip (such 
as an EPROM, or PROM) and associated socket, and other removable storage units 
1 122 and interfaces 1 120 which allow software and data to be transferred from the 
removable storage unit 1122 to computer system 1100, [0121]); 

in the program memory a program executable by the processing unit being 
loaded which program has a routine for driving the sensors and saving in a uniquely 
recognizable way each datum collected by the sensors and/or for saving data input by a 
service person or by other apparati, driver for activating or deactivating the means for 
carrying out mechanical, physical or chemical actions, such as actuators or the like 
(Computer programs (also called computer control logic) are stored in main memory 
1108 and/or secondary memory 1110. Computer programs may also be received via 
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communications interface 1124. Such computer programs, when executed, enable the 
computer system 11 00 to perform the features of the present invention as discussed 
herein. In particular, the computer programs, when executed, enable the processor 
11 04 to perform the features of the present invention. Accordingly, such computer 
programs represent controllers of the computer system 1100, [0124]); 

the program stored in the program memory further comprising means for 
evaluating the data collected by the sensors and/or the data inputted by a service 
person or by other apparati (Computer system 1100 may also include a 
communications interface 1124. Communications interface 1124 allows software and 
data to be transferred between computer system 1100 and external devices, 0122]); 

wherein the program includes a subroutine for executing an algorithm 
implementing the method of claim 72 on the collected and/or inputted data {mapping of 
massive data sets from a higher dimensional space to a lower dimensional space. 
Moreover, the method allows the mapping of new input patterns as they become 
available, without the need to reconstruct an entire map, [0026]). 

Regarding claim 118, Agrafiotis further discloses the program is provided with a 
subroutine for carrying out a cognitive analysis of multidimensional data by treating the 
said data with a method for projecting information data belonging to a multidimensional 
space into a space having less dimensions ([0026]) comprising the following steps: 
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providing a database of N-dimensional data in the form of records having a 
certain number of variables (a set of input patterns of dimensionality n, each with 
corresponding coordinates, [0090]); 

defining a metric function for calculating a distance between each record of the 
database {distant function d(), [0094]); 

calculating a matrix of distances between each record of the database by means 
of the metric function defined at the previous step (training set T is extracted from the 
set of input patterns n, the difference in distance between a random set of points X hk 
and the set of reference points (both of which were extracted from the input space R" is 
calculated, [0096]-[0097]); 

defining a N-1 dimensional space in which each record is defined by N-1 
coordinates (for a linear from n to m dimensions, a simple 3-layer network with n input 
and m output units can be employed. The network is trained to reproduce the input- 
output coordinates produced by the iterative algorithm, and thus encodes the mapping 
in its synaptic parameters in a compact, analytical manner, [0090]); 

calculating the N-1 coordinates of each record in the N-1 dimensional space by 
means of an evolutionary algorithm (coordinates of a plurality of objects on the m- 
dimensional nonlinear map are determined by the algorithm described in [0050]-[0053]); 

defining as the best projection of the records onto the N-1 dimensional space the 
projection in which the distance matrix of the records in the N-1 dimensional space best 
fits or has minimum differences with the distance matrix of the records calculated in the 
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n-dimensional space (Note that new patterns in R" that not in the original input set can 
also be projected into R™ in the manner shown in Figure 6. Once the system is trained, 
new patterns in R" are mapped by identifying the nearest local network and using that 
network in a feed-forward manner to perform the projection. The input for the system is 
a pattern 705 in R". This point is defined by its n attributes, x is x 2 , . . . x n ). The system 
includes a dispatcher module 710, which compares the distance of the input point to the 
network centers (i.e., the reference points), and forwards the input point to one of the 
available local neural networks 701, 702, or 703. Specifically, the input pattern is sent to 
the local neural network associated with the reference point nearest to the input pattern. 
The chosen network then performs the final projection, resulting in an output point in 
FT, [0110]). 

Aqrafiotis does not explicitly disclose the higher dimension is N while the lesser 
dimension N-1. 

However, Stolte discloses the higher dimension is N while the lesser dimension 
N-1 (Projection (Time, Product, Location -> Time, Location) reduces the dimensionality 
of an n-dimensional data cube to (n-1) by aggregating across a dimension. For 
example, in Figure 15, the first projection summarizes across "Location", reducing the 3- 
dimensional cube to a 2-dimensional cube, [0189]). 

It would have been obvious to an ordinary person skilled in the art at the time of 
the invention was made to incorporate the teachings of Stolte with the teachings of 
Aqrafiotis for the purpose of providing an interactive visual exploration tool that 
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facilitates exploratory analysis of databases having a hierarchical structure ([0038] of 
Stoke). 

Regarding claim 119, Aqrafiotis further discloses said database already contains 
the distances between the records (Figure 1 illustrates possibilities for a single 
hypothetical pairwise relationship and distances of corresponding objects on a nonlinear 
map, [0028]). 

Regarding claim 120, Aqrafiotis further discloses said evolutionary algorithm is a 
genetic algorithm (The approach employs an iterative algorithm based on subset 
refinements to nonlinearly map a small random sample which reflects the overall 
structure of the data, and then "learns" the underlying nonlinear transform using a set of 
distributed neural networks, each specializing in a particular domain of the feature 
space, [0045]). 

Regarding claim 121 , Aqrafiotis further discloses said subroutine provides for: 

encoding each individual record or variable represented by a point having 
coordinate X and Y (reducing the dimensionality of high-dimensional data in a way that 
preserves the original relationships of the data objects, and 2) producing Cartesian 
coordinate vectors from data supplied directly in the form of similarities or proximities, 
[0012]); 

defining a set of different X and Y coordinates for each point forming a first 
population of projections solution onto the less dimensional space, usually a two or 
three dimensional space (for a nonlinear projection from n to m dimensions, a standard 
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3-layer neural network with n input and m output units is used. Each n-dimensional 
object is presented to the input layer, and its coordinates on the m-dimensional 
nonlinear map are obtained by the respective units in the output layer, [0021]); 

calculating the fitness score for each of the projections of this first population by 
using as the fitness function the matrix of distances of the single points in the originally 
N dimensional space (pairwise relationship and the current distance of the 
corresponding objects on the nonlinear map are illustrated in Figure 1, where shaded 
areas 1 10, 1 12 and 1 14 denote allowed ranges for a given pairwise relationship, 
[0075]); 

subjecting the population of projections to combination according to certain 
combination rules thus producing a first generation population of projections which 
comprises X and Y coordinates for the points which are a combination of the 
coordinates provided in two projections of the parent generation {Note that new patterns 
in Ff that not in the original input set can also be projected into R™ in the manner shown 
in Figure 6. Once the system is trained, new patterns in Ff are mapped by identifying 
the nearest local network and using that network in a feed-forward manner to perform 
the projection. The input for the system is a pattern 705 in Ff. This point is defined by its 
n attributes, x is x 2 , . . . x n ). The system includes a dispatcher module 710, which 
compares the distance of the input point to the network centers (i.e., the reference 
points), and forwards the input point to one of the available local neural networks 701, 
702, or 703. Specifically, the input pattern is sent to the local neural network associated 
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with the reference point nearest to the input pattern. The chosen network then performs 
the final projection, resulting in an output point in R"\ [01 10]); and 

calculating the fitness score of the projections of the first generation and forming 
again a new generation basing on the first generation {The points (objects) in each cell 
of the grid are then subjected to a phase of local refinement (i.e. only objects from within 
the same cell are compared and refined). Preferably, the number of sampling steps in 
each cell should be proportional to the number of objects contained in that cell. This 
process is highly parallelizable. This local refinement phase is then followed by another 
global refinement phase, and the process is repeated for a prescribed number of cycles, 
or until the embedding error is minimized within a prescribed tolerance, [0086]). 

Regarding claim 122. Aqrafiotis further discloses the genetic algorithm is the 
GenD algorithm (employ Sammon's nonlinear mapping algorithm to project a small 
random sample of objects from a given population, and then "learn" the underlying 
nonlinear transform using a multilayer neural network trained with the standard error 
back-propagation algorithm or some other equivalent technique, [0021]). 

Regarding claim 123, Aqrafiotis further discloses a hidden point can be defined 
which corresponds to a hidden record or a to a hidden variable whose existence is only 
speculative, and wherein said hidden point is added in the parent population by giving it 
positional coordinates X hi and Y hi in the projection {multi-layer feed-forward networks 
trained to reproduce their inputs as desired outputs. They consist of an input and an 
output layer containing as many neurons as the number of input dimensions, and a 
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series of hidden layers having a smaller number of units. In the first part of the network, 
each sample is reorganized, mixed, and compressed into a compact representation 
encoded by the middle layer. This representation is then decompressed by the second 
part of the network to reproduce the original input, [001 1]). 

Regarding claim 124, Agrafiotis further discloses the calculation of the 
evolutionary algorithm is carried out in parallel with the hidden point and without the 
hidden point and the best fit projections obtained by the two parallel calculations are 
compared (a phase of global refinement is initially carried out, after which, the resulting 
nonlinear map is partitioned into a regular grid. The points (objects) in each cell of the 
grid are then subjected to a phase of local refinement (i.e. only objects from within the 
same cell are compared and refined). Preferably, the number of sampling steps in each 
cell should be proportional to the number of objects contained in that cell. This process 
is highly parallelizable. This local refinement phase is then followed by another global 
refinement phase, and the process is repeated for a prescribed number of cycles, or 
until the embedding error is minimized within a prescribed tolerance, [0086]). 

Regarding claim 125, Agrafiotis further discloses: 

providing a database comprising a certain number of records each one 
characterized by a certain number of variables (a set of input patterns of dimensionality 
n, each with corresponding coordinates, [0090]); 

elaborating the database alternatively or in parallel according to: a first manner in 
which the records are considered as being points and the variables as being the 
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coordinates of the points; and a second manner in which the variables are considered 
as being points and the records are the coordinates {training set Tis extracted from the 
set of input patterns n, the difference in distance between a random set of points X\. k 
and the set of reference points (both of which were extracted from the input space R" is 
calculated, [0096]-[0097]). 

Regarding claim 126, Aqrafiotis further discloses a different treatment of the data 
as a pre or post processing phase (The method uses a self-organizing principle to 
iteratively refine an initial (random or partially ordered) configuration of objects by 
analyzing only a subset of objects and their associated relationships at a time, [0047]). 

Regarding claim 127, Aqrafiotis further discloses characterized in that the data is 
processed in a preventive step by means of a Self Organizing Map algorithm (The 
method uses a self-organizing principle to iteratively refine an initial (random or partially 
ordered) configuration of objects by analyzing only a subset of objects and their 
associated relationships at a time, [0047]), the clusters formed by said SOM algorithm in 
the different units then being projected onto said space of lesser dimension (the 
reference points c, are determined using a clustering algorithm described in greater 
detail below. In step 325, the training set T is partitioned into c disjoint clusters based on 
the distance of each point x, from each reference point. The training phase concludes 
with step 335, [0097]). 

Regarding claim 128, Aqrafiotis further discloses a relationship between the 
collected data records of the collected data is determined by means of the distance of 
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each data record from the other data records and wherein said distance is used as a 
relevance weight for each data record in determining the activation or deactivation of 
one or more of the means for carrying out mechanical, physical and/or chemical actions 
(the basic algorithm does not distinguish weak from strong relationships (short-range 
and long-range distances, respectively). One method to ensure that strong relationships 
are preserved more faithfully than weak relationships is to weight the coordinate update 
in Equation 5 (or, equivalently, the learning rate A in Equations 7 and 8) by a scaling 
factor that is inversely proportional to the strength (magnitude) of the relationship, 
[0085]). 

Regarding claim 129, Aqrafiotis further discloses a maximum distance for each 
data record in determining the activation or record from the other is set for 
discriminating the data records to be used in determining the activation or deactivation 
of one or more of the means for carrying out said mechanical, physical and or chemical 
actions (If cfy is larger than r max , the coordinates of the objects are updated using 

l~max 3S 

the target distance (Equation 9), [0071]). 

Regarding claim 130, Aqrafiotis further discloses the clustering or distance of the 
data records on the map onto which the database has been projected is used as a 
measure of similarity of the data records or of the variables related to the said data 
records (the algorithm attempts to match the upper bound if the current distance 
between the objects is greater than the upper bound, or the lower bound if the current 
distance between the objects is lower than the lower bound. If the distance between the 
objects lies within the upper and lower bounds, no correction is made, [0073]). 
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Regarding claim 131, Agrafiotis clearly shows and discloses an apparatus 
having artificial intelligence containing a simulation of intuitive behaviour ([0120]), 
comprising: 

a processing unit connected to each of a data memory and a program memory, 
wherein the processing unit is further connected to one or more sensors for detecting or 
measuring different physical and/or chemical conditions or effects or processes 
characterising or occurring in the environment {Figure 11 shows the computer system 
1000 includes one or more processors, such as processor 1104. The processor 1 1 04 is 
connected to a communication infrastructure 1106 (e.g., a bus or network), [01 19]. See 
further [001 1] for neurons acting as sensors); 

wherein the processing unit is further connected to data input means by means 
of a service person or a data input line from other data collecting apparati, wherein the 
processing unit is further connected to means for carrying out mechanical, physical or 
chemical actions, such as actuators or the like (Secondary memory 1110 can also 
include other similar means for allowing computer programs or input data to be loaded 
into computer system 1100. Such means may include, for example, a removable 
storage unit 1122 and an interface 1 120. Examples of such may include a program 
cartridge and cartridge interface (such as that found in video game devices), a 
removable memory chip (such as an EPROM, or PROM) and associated socket, and 
other removable storage units 1 122 and interfaces 1 120 which allow software and data 
to be transferred from the removable storage unit 1122 to computer system 1100, 
[0121]); 
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wherein the program memory stores a program executable by the processing 
unit, said program including a routine for driving the sensors and saving in a uniquely 
recognizable way each datum collected by the sensors and/or for saving data input by a 
service person or by other apparati, driver for activating or deactivating the means for 
carrying out mechanical, physical or chemical actions, such as actuators or the like 
{Computer programs (also called computer control logic) are stored in main memory 
1108 and/or secondary memory 1110. Computer programs may also be received via 
communications interface 1124. Such computer programs, when executed, enable the 
computer system 1100 to perform the features of the present invention as discussed 
herein. In particular, the computer programs, when executed, enable the processor 
11 04 to perform the features of the present invention. Accordingly, such computer 
programs represent controllers of the computer system 1100, [0124]); 

wherein the program stored in the program memory further comprises means for 
evaluating data collected by the sensors and/or inputted by a service person or by other 
apparati (Computer system 1100 may also include a communications interface 1124. 
Communications interface 1124 allows software and data to be transferred between 
computer system 1100 and external devices, 0122]); and 

wherein in operation the apparatus: 

generates from the collected and/or inputted data a database with a certain 
number of records each one comprising a certain number of variables and which are 
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relative to a N-dimensional space (a set of input patterns of dimensionality n, each with 
corresponding coordinates, [0090]); 

projects the database considering the record as points and the variables as 
coordinates or the variables as points and the records as coordinates onto a space 
having a reduced number of dimension relatively to the space N dimensional space 
(mapping of massive data sets from a higher dimensional space to a lower dimensional 
space, [0026]), said projection being carried out by means of an algorithm for projecting 
information data belonging to a multidimensional space into a space having lesser 
dimensions comprising: calculating a matrix of distances between each point defined by 
a record or a variable of the database by means of a metric function (training set T is 
extracted from the set of input patterns n, the difference in distance between a random 
set of points Xm and the set of reference points (both of which were extracted from the 
input space R" is calculated, [0096]-[0097]); 

defining a N-1 dimensional space in which each point represented by a record or 
a variable is defined by N-1 coordinates (for a linear from n to m dimensions, a simple 
3-layer network with n input and m output units can be employed. The network is trained 
to reproduce the input-output coordinates produced by the iterative algorithm, and thus 
encodes the mapping in its synaptic parameters in a compact, analytical manner, 
[0090]); 

calculating the N-1 coordinates of each point in the N-1 dimensional space by 
means of an evolutionary algorithm (coordinates of a plurality of objects on the m- 
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dimensional nonlinear map are determined by the algorithm described in [0050]-[0053]); 
and 

defining as the best projection of the points onto the N-1 dimensional space the 
projection in which the distance matrix of the points in the n-1 dimensional space best 
fits or has minimum differences with the distance matrix of the points calculated in the 
N-dimensional space (Note that new patterns in Ff that not in the original input set can 
also be projected into R™ in the manner shown in Figure 6. Once the system is trained, 
new patterns in R" are mapped by identifying the nearest local network and using that 
network in a feed-forward manner to perform the projection. The input for the system is 
a pattern 705 in Ff. This point is defined by its n attributes, x is xz . . . x n ). The system 
includes a dispatcher module 710, which compares the distance of the input point to the 
network centers (i.e., the reference points), and forwards the input point to one of the 
available local neural networks 701, 702, or 703. Specifically, the input pattern is sent to 
the local neural network associated with the reference point nearest to the input pattern. 
The chosen network then performs the final projection, resulting in an output point in 
FT, [0110]). 

Agrafiotis does not explicitly disclose the higher dimension is N while the lesser 
dimension N-1. 

However, Stolte discloses the higher dimension is N while the lesser dimension 
N-1 {Projection (Time, Product, Location -> Time, Location) reduces the dimensionality 
of an n-dimensional data cube to (n-1) by aggregating across a dimension. For 
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example, in Figure 15, the first projection summarizes across "Location", reducing the 3- 
dimensional cube to a 2-dimensional cube, [01 89]). 

It would have been obvious to an ordinary person skilled in the art at the time of 
the invention was made to incorporate the teachings of Stolte with the teachings of 
Agrafiotis for the purpose of providing an interactive visual exploration tool that 
facilitates exploratory analysis of databases having a hierarchical structure ([0038] of 
Stolte ). 

Regarding claim 132, Agrafiotis further discloses said database already contains 
distances between the records (Figure 1 illustrates possibilities for a single hypothetical 
pairwise relationship and distances of corresponding objects on a nonlinear map, 
[0028]). 

Regarding claim 133, Agrafiotis further discloses the evolutionary algorithm is a 
genetic algorithm (The approach employs an iterative algorithm based on subset 
refinements to nonlinearly map a small random sample which reflects the overall 
structure of the data, and then "learns" the underlying nonlinear transform using a set of 
distributed neural networks, each specializing in a particular domain of the feature 
space, [0045]). 

Regarding claim 134, Agrafiotis further discloses encoding each individual 
record or variable represented by a point having coordinates X and Y; 

defining a set of different X and Y coordinates for each point forming a first 
population of projections solution onto the lesser dimensional space, usually a two or 
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three dimensional space (reducing the dimensionality of high-dimensional data in a way 
that preserves the original relationships of the data objects, and 2) producing Cartesian 
coordinate vectors from data supplied directly in the form of similarities or proximities, 
[0012]); 

calculating a fitness score for each of the projections of said first population by 
using as a fitness function the matrix of distances between the single points in the 
original N dimensional space (pairwise relationship and the current distance of the 
corresponding objects on the nonlinear map are illustrated in Figure 1, where shaded 
areas 1 10, 1 12 and 1 14 denote allowed ranges for a given pairwise relationship, 
[0075]); 

combining the population of projections according to certain combinatorial rules 
and producing a first generation population of projections which comprises X and Y 
coordinates for the points which are a combination of the coordinates provided in two 
projections of the parent generation (Note that new patterns in Ff that not in the original 
input set can also be projected into Ff in the manner shown in Figure 6. Once the 
system is trained, new patterns in Ff are mapped by identifying the nearest local 
network and using that network in a feed-forward manner to perform the projection. The 
input for the system is a pattern 705 in Ff. This point is defined by its n attributes, x is 
x 2 , . . . x n ). The system includes a dispatcher module 710, which compares the distance 
of the input point to the network centers (i.e., the reference points), and forwards the 
input point to one of the available local neural networks 701, 702, or 703. Specifically, 
the input pattern is sent to the local neural network associated with the reference point 
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nearest to the input pattern. The chosen network then performs the final projection, 
resulting in an output point in R m , [01 1 0]); and 

calculating the fitness score of the projections of the first generation and forming 
a new generation basing on the first generation {The points (objects) in each cell of the 
grid are then subjected to a phase of local refinement (i.e. only objects from within the 
same cell are compared and refined). Preferably, the number of sampling steps in each 
cell should be proportional to the number of objects contained in that cell. This process 
is highly parallelizable. This local refinement phase is then followed by another global 
refinement phase, and the process is repeated for a prescribed number of cycles, or 
until the embedding error is minimized within a prescribed tolerance, [0086]). 

Regarding claim 135. Aqrafiotis further discloses the evolutionary algorithm is 
the GenD algorithm (employ Sammon's nonlinear mapping algorithm to project a small 
random sample of objects from a given population, and then "learn" the underlying 
nonlinear transform using a multilayer neural network trained with the standard error 
back-propagation algorithm or some other equivalent technique, [0021]). 

Regarding claim 136, Aqrafiotis further discloses a hidden point represented by 
a hidden record or a hidden variable can be defined, which corresponds to a hidden 
point on the map whose existence is only speculated, and wherein said hidden point is 
added to the parent population by giving it positional coordinates X hi and Y hi in the 
projection (multi-layer feed-forward networks trained to reproduce their inputs as 
desired outputs. They consist of an input and an output layer containing as many 
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neurons as the number of input dimensions, and a series of hidden layers having a 
smaller number of units. In the first part of the network, each sample is reorganized, 
mixed, and compressed into a compact representation encoded by the middle layer. 
This representation is then decompressed by the second part of the network to 
reproduce the original input, [001 1]). 

Regarding claim 137, Agrafiotis further discloses the calculation of the 
evolutionary algorithm is carried out in parallel with and without the hidden point and 
wherein the best fit projections obtained by the two parallel calculations are compared 
(a phase of global refinement is initially carried out, after which, the resulting nonlinear 
map is partitioned into a regular grid. The points (objects) in each cell of the grid are 
then subjected to a phase of local refinement (i.e. only objects from within the same cell 
are compared and refined). Preferably, the number of sampling steps in each cell 
should be proportional to the number of objects contained in that cell. This process is 
highly parallelizable. This local refinement phase is then followed by another global 
refinement phase, and the process is repeated for a prescribed number of cycles, or 
until the embedding error is minimized within a prescribed tolerance, [0086]). 

Regarding claim 138, Agrafiotis further discloses: 

providing a database comprising a certain number of records each one 
characterised by a certain number of variables (a set of input patterns of dimensionality 
n, each with corresponding coordinates, [0090]); 
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elaborating the database alternatively or in parallel according to: a first manner in 
which the records are considered as being points and the variables as being the 
coordinates of the points; and a second manner in which the variables are considered 
as being points and the records are the coordinates {training set Tis extracted from the 
set of input patterns n, the difference in distance between a random set of points X\. k 
and the set of reference points (both of which were extracted from the input space R" is 
calculated, [0096]-[0097]). 

Regarding claim 139, Aqrafiotis further discloses treating the database as a pre- 
processing or post-processing phase (The method uses a self-organizing principle to 
iteratively refine an initial (random or partially ordered) configuration of objects by 
analyzing only a subset of objects and their associated relationships at a time, [0047]). 

Regarding claim 140, Aqrafiotis further discloses the database is processed in a 
preventive stage by means of a Self Organising Map algorithm {The method uses a self- 
organizing principle to iteratively refine an initial (random or partially ordered) 
configuration of objects by analyzing only a subset of objects and their associated 
relationships at a time, [0047]), the clusters formed by said SOP algorithm in the 
different units being projected onto said space of lesser dimension (the reference points 
Cj are determined using a clustering algorithm described in greater detail below. In step 
325, the training set T is partitioned into c disjoint clusters based on the distance of 
each point x, from each reference point. The training phase concludes with step 335, 
[0097]). 
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Regarding claim 141 , Agrafiotis further discloses the clustering or distance of the 
points on the map onto which the database has been projected is used as a measure of 
similarity of the records or of the variables related to the said point (the algorithm 
attempts to match the upper bound if the current distance between the objects is greater 
than the upper bound, or the lower bound if the current distance between the objects is 
lower than the lower bound. If the distance between the objects lies within the upper 
and lower bounds, no correction is made, [0073]). 

Regarding claim 142. Agrafiotis further discloses: 

a database comprising a certain number of records, each record being related to 
a certain number of variables, is provided (a set of input patterns of dimensionality n, 
each with corresponding coordinates, [0090]); 

variables complementary to the variables originally provided are added to said 
database (the basic algorithm does not distinguish weak from strong relationships 
(short-range and long-range distances, respectively). One method to ensure that strong 
relationships are preserved more faithfully than weak relationships is to weight the 
coordinate update in Equation 5 (or, equivalently, the learning rate A in Equations 7 and 
8) by a scaling factor that is inversely proportional to the strength (magnitude) of the 
relationship, [0085]) and the resulting integrated database is projected onto a space of 
lesser dimension, particularly onto a two or three dimensional space (Note that new 
patterns in R" that not in the original input set can also be projected into R™ in the 
manner shown in Figure 6. Once the system is trained, new patterns in Ff are mapped 
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by identifying the nearest local network and using that network in a feed-forward 
manner to perform the projection. The input for the system is a pattern 705 in R". This 
point is defined by its n attributes, x is x 2 , . . . x n ). The system includes a dispatcher 
module 710, which compares the distance of the input point to the network centers (i.e., 
the reference points), and forwards the input point to one of the available local neural 
networks 701, 702, or 703. Specifically, the input pattern is sent to the local neural 
network associated with the reference point nearest to the input pattern. The chosen 
network then performs the final projection, resulting in an output point in R™, [01 1 0]); 
and 

the distance in the map between each variable and its complementary variable is 
used as a measure of the relevance of said variable in the database {training set T is 
extracted from the set of input patterns n, the difference in distance between a random 
set of points Xi. k and the set of reference points (both of which were extracted from the 
input space R° is calculated, [0096]-[0097]). 

14. Claims 94-95 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Aqrafiotis et al. (Pub. No. US 2002/0091655, published on July 11, 2002; hereinafter 
Agrafjgtis) in view of Stolte et al. (Pub. No. US 2004/0243593, filed on June 2, 2003; 
hereinafter Stolte ), and further in view of Granger (Pat. No. US 6,463,321, published on 
September 6, 2007). 

Regarding claim 94, Aqrafiotis , as modified by Stolte , does not discloses said 
method is used to evaluate the relevance of certain variables in determining a certain 
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pathological status of individuals, and to define prototypes of individuals relative to the 
variables of the database and their probability of developing a certain disease. 

However, Granger discloses said method is used to evaluate the relevance of 
certain variables in determining a certain pathological status of individuals, and to define 
prototypes of individuals relative to the variables of the database and their probability of 
developing a certain disease (projections of the characterizing ERP signal vector are 
generated. A projection of a vector is produced when some of the coordinates of the 
vector are eliminated. Generally, " projection" refers to the operation of transforming 
high -dimensional data into lower -dimensional spaces. An example would be the 
viewing of three-dimensional images in a two-dimensional plane (e.g., a photograph), 
[0032]. See [001 0] for diagnosing the presence of a neurological disorder (such as 
Alzheimer's Disease, depression, or schizophrenia), otherwise assessing the 
neurological condition of a patient, or characterizing the results of a treatment regimen 
used by a patient). 

It would have been obvious to an ordinary person skilled in the art at the time of 
the invention was made to incorporate the teachings of Granger with the teachings of 
Agrafiotis , as modified by Stolte, for the purpose of assessing the neurological condition 
of a patient, where the method obtains and processes relatively consistent, low noise, 
artifact-free data in a fast, inexpensive, and reliable ways ([0009] of Granger ). 

Regarding claim 95, Granger further discloses analyze the probability of an 
individual of having or developing Alzheimer's {diagnosing the presence of a 
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neurological disorder (such as Alzheimer's Disease, depression, or schizophrenia), 
otherwise assessing the neurological condition of a patient, or characterizing the results 
of a treatment regimen used by a patient, [001 0]). 

15. Claims 98, 100, are 103-106 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Aqrafiotis et al. (Pub. No. US 2002/0091655, published on July 11, 
2002; hereinafter Aqrafiotis ) in view of Stolte etal. (Pub. No. US 2004/0243593, filed on 
June 2, 2003; hereinafter Stolte ). and further in view of Schipper (Pat. No. US 
5,581,259, published on December 3, 1996). 

Regarding claim 98, Aqrafiotis further discloses said method is used to generate 
two dimensional maps of data records starting from a database comprising relative 
distances between the records. 

Aqrafiotis , as modified by Stolte , does not disclose that the mapping process is 
for geographical sites. 

However, Schipper discloses mapping process if for geographical sites {Figure 5 
is a projection of a three-dimensional region on a two-dimensional surface showing the 
projected locations of three landmarks, L1, L2 and L3, that are also shown on an old 
map as W, L2' and L3' in Figure 5, with the respective location coordinates (x1',y1',z1'), 
(x2',y2',z2') and (x3',y3',z3', [Column 25, Lines 37-43]). 

It would have been obvious to an ordinary person skilled in the art at the time of 
the invention was made to incorporate the teachings of Schipper with the teachings of 
Aqrafiotis , as modified by Stolte , for the purpose of allowing reconciliation of a more 
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accurate navigation system with a less accurate physical map or chart to provide 
increased accuracy for the combination ([Column 4, Line 65 -» Column 5, Line 4] of 
Schipper ). 

Regarding claim 100. Aqrafiotis further discloses the evolutionary algorithm is a 
genetic algorithm (The approach employs an iterative algorithm based on subset 
refinements to nonlinearly map a small random sample which reflects the overall 
structure of the data, and then "learns" the underlying nonlinear transform using a set of 
distributed neural networks, each specializing in a particular domain of the feature 
space, [0045]). 

Regarding claim 103, Aqrafiotis further discloses at least one hidden or 
hypothetical geographical site is added to the database of geographical sites of which 
neither the coordinates nor the distances are known and wherein a first and a second 
set of coordinates for said at least one geographical site are freely defined {The method 
uses a self-organizing principle to iteratively refine an initial (random or partially 
ordered) configuration of objects by analyzing only a subset of objects and their 
associated relationships at a time. The relationship data may be complete or incomplete 
(i.e. some relationships between objects may not be known), exact or inexact (i.e. some 
or all relationships may be given in terms of allowed ranges or limits), symmetric or 
asymmetric (i.e. the relationship of object A to object B may not be the same as the 
relationship of B to A) and may contain systematic or stochastic errors, [0047]). 
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Regarding claim 104, Agrafiotis further discloses the calculation of the 
evolutionary algorithm is carried out in parallel for the database with and without the 
hidden or hypothetical geographical site, and wherein the best fit set of positional 
coordinates of the totality of geographical sites obtained by said two parallel calculations 
are compared (a phase of global refinement is initially carried out, after which, the 
resulting nonlinear map is partitioned into a regular grid. The points (objects) in each 
cell of the grid are then subjected to a phase of local refinement (i.e. only objects from 
within the same cell are compared and refined). Preferably, the number of sampling 
steps in each cell should be proportional to the number of objects contained in that cell. 
This process is highly parallelizable. This local refinement phase is then followed by 
another global refinement phase, and the process is repeated for a prescribed number 
of cycles, or until the embedding error is minimized within a prescribed tolerance, 
[0086]). 

Regarding claim 105, Agrafiotis further discloses an additional pre-processing or 
post-processing phase is provided (The method uses a self-organizing principle to 
iteratively refine an initial (random or partially ordered) configuration of objects by 
analyzing only a subset of objects and their associated relationships at a time, [0047]). 

Regarding claim 106, Agrafiotis further discloses in a preventive phase the 
known distance data matrix is subjected to treatment by means of a Self Organising 
Map algorithm (The method uses a self-organizing principle to iteratively refine an initial 
(random or partially ordered) configuration of objects by analyzing only a subset of 
objects and their associated relationships at a time, [0047]), and wherein the clusters 
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formed by said SOM algorithm in the different units are then projected into said lesser 
dimensional space (the reference points a are determined using a clustering algorithm 
described in greater detail below. In step 325, the training set T is partitioned into c 
disjoint clusters based on the distance of each point x, from each reference point. The 
training phase concludes with step 335, [0097]). 

16. Claims 99, and 101-102 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Aqrafiotis et al. (Pub. No. US 2002/0091655, published on July 11, 
2002; hereinafter Aqrafiotis ) in view of Schipper (Pat. No. US 5,581,259, published on 
December 3, 1996). 

Regarding claim 99, Aqrafiotis clearly shows and discloses a method for 
generating two or three dimensional maps of data records starting from a database 
containing relative distances between the records ([0026]), comprising: 

organizing the known or measured distance values of the data records in a 
matrix form (training set T is extracted from the set of input patterns n, the difference in 
distance between a random set of points X hk and the set of reference points (both of 
which were extracted from the input space R" is calculated, [0096]-[0097]); 

defining a two or a three dimensional space in which the position of each record 
is uniquely defined by two or three coordinates (a set of input patterns of dimensionality 
n, each with corresponding coordinates, [0090]); 

determining the positional coordinates of each data records in the two or three 
dimensional space using an evolutionary algorithm (coordinates of a plurality of objects 
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on the m-dimensional nonlinear map are determined by the algorithm described in 
[0050]-[0053]); 

determining the distances of the data records one from the other using the 
calculated two or three dimensional positional coordinates of said data records one from 
the other (training set Tis extracted from the set of input patterns n, the difference in 
distance between a random set of points X hk and the set of reference points (both of 
which were extracted from the input space R" is calculated, [0096]-[0097]); 

generating a matrix of distances with distance values determined according to 
said means; and defining as the best two or three dimensional coordinates of position of 
the totality of the data records in the two or three dimensional space, the two or three 
dimensional coordinates of position of the said data records for which the distance 
matrix determined therefrom best fits or has a minimum difference with the distance 
matrix of the known or measured distance values of the data records {Note that new 
patterns in R" that not in the original input set can also be projected into R™ in the 
manner shown in Figure 6. Once the system is trained, new patterns in R" are mapped 
by identifying the nearest local network and using that network in a feed-forward 
manner to perform the projection. The input for the system is a pattern 705 in R". This 
point is defined by its n attributes, x is x 2 , . . . x n ). The system includes a dispatcher 
module 710, which compares the distance of the input point to the network centers (i.e., 
the reference points), and forwards the input point to one of the available local neural 
networks 701, 702, or 703. Specifically, the input pattern is sent to the local neural 
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network associated with the reference point nearest to the input pattern. The chosen 
network then performs the final projection, resulting in an output point in Ff, [0110]). 

Aqrafiotis does not disclose that the mapping process is for geographical sites. 

However, Schipper discloses mapping process if for geographical sites (Figure 5 
is a projection of a three-dimensional region on a two-dimensional surface showing the 
projected locations of three landmarks, L1, L2 and L3, that are also shown on an old 
map as W, L2' and L3' in Figure 5, with the respective location coordinates (x1',y1',zV), 
(x2',y2',z2') and (x3',y3',z3\ [Column 25, Lines 37-43]). 

It would have been obvious to an ordinary person skilled in the art at the time of 
the invention was made to incorporate the teachings of Schipper with the teachings of 
Aqrafiotis for the purpose of allowing reconciliation of a more accurate navigation 
system with a less accurate physical map or chart to provide increased accuracy for the 
combination ([Column 4, Line 65 -> Column 5, Line 4] of Schipper ). 

Regarding claim 101, Aqrafiotis further discloses: 

a) for each geographical site a first and a second set of coordinates defining the 
position of each geographical site in the two or three dimensional space is calculated 
(for a nonlinear projection from n to m dimensions, a standard 3-layer neural network 
with n input and m output units is used. Each n-dimensional object is presented to the 
input layer, and its coordinates on the m-dimensional nonlinear map are obtained by the 
respective units in the output layer, [0021]); 
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b) a fitness score of the matrix of distances among the geographical sites 
determined by means of the first and second set of coordinates defining the position of 
each geographical site in the two or three dimensional space is calculated by using as a 
fitness function the matrix of the known or measured distances of the geographical sites 
(pairwise relationship and the current distance of the corresponding objects on the 
nonlinear map are illustrated in Figure 1, where shaded areas 110, 112 and 114 denote 
allowed ranges for a given pairwise relationship, [0075]); 

c) the first and second set of coordinates of position of each geographical site are 
combined for each geographical site according to predetermined combination rules, 
thus producing at least a new first and second set of coordinates of position for each 
geographical site (Note that new patterns in Ff that not in the original input set can also 
be projected into Ff 1 in the manner shown in Figure 6. Once the system is trained, new 
patterns in Ff are mapped by identifying the nearest local network and using that 
network in a feed-forward manner to perform the projection. The input for the system is 
a pattern 705 in Ff. This point is defined by its n attributes, x is x 2 , . . . x n ). The system 
includes a dispatcher module 710, which compares the distance of the input point to the 
network centers (i.e., the reference points), and forwards the input point to one of the 
available local neural networks 701, 702, or 703. Specifically, the input pattern is sent to 
the local neural network associated with the reference point nearest to the input pattern. 
The chosen network then performs the final projection, resulting in an output point in 
FT, [0110]); 
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d) a fitness score of the said new first and second set of coordinates of position 
according to (b) is calculated (The points (objects) in each cell of the grid are then 
subjected to a phase of local refinement (i.e. only objects from within the same cell are 
compared and refined). Preferably, the number of sampling steps in each cell should be 
proportional to the number of objects contained in that cell. This process is highly 
parallelizable. This local refinement phase is then followed by another global refinement 
phase, and the process is repeated for a prescribed number of cycles, or until the 
embedding error is minimized within a prescribed tolerance, [0086]); and 

e) said new first and second set of coordinates of position are further combined 
according to (c), and (c) through (e) are repeated until at least one new first or second 
set of positional coordinates reaches a maximum fitness score or is greater than a 
minimum predefined fitness score (If dij is larger than r max , the coordinates of the objects 
are updated using r max as the target distance (Equation 9), [0071]). 

Regarding claim 102, Agrafiotis further discloses for each combination of at least 
one first and one second set of coordinates several new sets of coordinates are 
obtained by the combinations of the said at least first and second set of coordinates 
(Note that new patterns in R" that not in the original input set can also be projected into 
Ff in the manner shown in Figure 6. Once the system is trained, new patterns in Ff are 
mapped by identifying the nearest local network and using that network in a feed- 
forward manner to perform the projection. The input for the system is a pattern 705 in 
R". This point is defined by its n attributes, x is x 2 , . . . x n ). The system includes a 
dispatcher module 710, which compares the distance of the input point to the network 



Application/Control Number: 10/563,409 Page 52 

Art Unit: 2165 

centers (i.e., the reference points), and forwards the input point to one of the available 
local neural networks 701, 702, or 703. Specifically, the input pattern is sent to the local 
neural network associated with the reference point nearest to the input pattern. The 
chosen network then performs the final projection, resulting in an output point in R™, 
[0110]). 

17. Claims 107, and 114-115 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Aqrafiotis et al. (Pub. No. US 2002/0091655, published on July 11, 
2002; hereinafter Aqrafiotis ) in view of Stolte etal. (Pub. No. US 2004/0243593, filed on 
June 2, 2003; hereinafter Stolte ). and further in view of Blanev et al. (Pat. No. US 
5,680,331, published on October 21, 1997; hereinafter Blanev ). 

Regarding claim 107, Aqrafiotis . as modified by Stolte . does not disclose said 
method is used to represent the structure of a molecule in a three dimensional or two 
dimensional space, by indicating only the relative distances of the atoms of the 
molecule. 

However, Blanev discloses the structure of a molecule in a three dimensional or 
two dimensional space, by indicating only the relative distances of the atoms of the 
molecule (Figure 1a is a representation of a protein, namely epidermal growth factor 
(EGF), showing its alpha carbon backbone B and the side chains TYR 13 (Tyrosine), 
CYS 14 (Cysteine), LEU 15 (Leucine), ASN 16 (Asparagine), and ARG 41 (Arginine), 
CYS 42 (Cysteine), and GLN 43 (Glutamine) of its putative active site AS. EGF has a 
complicated three-dimensional configuration, of which Figure 1a is a two-dimensional 
projection. Figure 1b is another projection of the molecule, rotated 90 degrees around 
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the Y-axis which is shown for reference in each of these two figures, [Column 3, Lines 
26-37]). 

It would have been obvious to an ordinary person skilled in the art at the time of 
the invention was made to incorporate the teachings of Blanev with the teachings of 
Aqrafiotis , as modified by Stolte , for the purpose of identifying the putative active site of 
a molecule, determining the structure of the active site, and, by means of a depth-first 
recursive procedure, generating designs of molecules which mimic that site to a desired 
degree of accuracy ([Column 1 , Lines 58-64] of Blanev ). 

Regarding claim 114, Aqrafiotis further discloses a further pre-processing or 
post-processing phase is provided (The method uses a self-organizing principle to 
iteratively refine an initial (random or partially ordered) configuration of objects by 
analyzing only a subset of objects and their associated relationships at a time, [0047]). 

Regarding claim 115, Aqrafiotis further discloses in a preventive phase the 
known distance data matrix is subjected to treatment by means of a Self Organising 
Map algorithm (The method uses a self-organizing principle to iteratively refine an initial 
(random or partially ordered) configuration of objects by analyzing only a subset of 
objects and their associated relationships at a time, [0047]), the clusters formed by said 
SOM algorithm in the different units being then processed according to said method (the 
reference points c, are determined using a clustering algorithm described in greater 
detail below. In step 325, the training set T is partitioned into c disjoint clusters based on 
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the distance of each point x, from each reference point. The training phase concludes 
with step 335, [0097]). 

18. Claims 108-113, and 116 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Aqrafiotis et al. (Pub. No. US 2002/0091655, published on July 11, 
2002; hereinafter Aqrafiotis ) in view of Blanev et al. (Pat. No. US 5,680,331, published 
on October 21, 1997; hereinafter Blanev ). 

Regarding claim 108, Aqrafiotis further discloses a method for representing the 
structure of a database in a three dimensional or two dimensional space by indicating 
only the relative distances of at least part of the data records of the database relative to 
one another, comprising: 

a) organizing known or measured distance values of the data records in a matrix 
form (training set T is extracted from the set of input patterns n, the difference in 
distance between a random set of points Xj. k and the set of reference points (both of 
which were extracted from the input space R" is calculated, [0096]-[0097]); 

b) defining a two or a three dimensional space in which the position of each data 
records is uniquely defined by two or three coordinates (a set of input patterns of 
dimensionality n, each with corresponding coordinates, [0090]); 

c) determining the two or three coordinates of the position in the two or three 
dimensional space of each data record by means of an evolutionary algorithm 
(coordinates of a plurality of objects on the m-dimensional nonlinear map are 
determined by the algorithm described in [0050] -[0053]); 
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d) determining the distances of the data records one from the other by means of 
the calculated two or three dimensional coordinates of position of the said data records 
one from the other (training set Tis extracted from the set of input patterns n, the 
difference in distance between a random set of points Xj. k and the set of reference 
points (both of which were extracted from the input space R" is calculated, [0096]- 
[0097]); 

e) generating a matrix of distances with distances determined according to (d); 
and f) defining as the best two or three dimensional coordinates of position of the 
totality of the data records in the two or three dimensional space, the two or three 
dimensional coordinates of position of the said data records for which the distance 
matrix determined therefrom best fits, or has a minimum difference, with the distance 
matrix of the known or measured distance values of the data records {Note that new 
patterns in R" that not in the original input set can also be projected into R™ in the 
manner shown in Figure 6. Once the system is trained, new patterns in R" are mapped 
by identifying the nearest local network and using that network in a feed-forward 
manner to perform the projection. The input for the system is a pattern 705 in R". This 
point is defined by its n attributes, x is x 2 , . . . x n ). The system includes a dispatcher 
module 710, which compares the distance of the input point to the network centers (i.e., 
the reference points), and forwards the input point to one of the available local neural 
networks 701, 702, or 703. Specifically, the input pattern is sent to the local neural 
network associated with the reference point nearest to the input pattern. The chosen 
network then performs the final projection, resulting in an output point in R™, [01 1 0]). 
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Agrafiotis does not disclose the mapping process from three-dimensional space 
to two-dimensional space is also used for atoms and molecules. 

However, Blanev discloses the mapping process from three-dimensional space 
to two-dimensional space is also used for atoms and molecules {Figure 1a is a 
representation of a protein, namely epidermal growth factor (EGF), showing its alpha 
carbon backbone B and the side chains TYR 13 (Tyrosine), CYS 14 (Cysteine), LEU 15 
(Leucine), ASN 16 (Asparagine), and ARG 41 (Arginine), CYS 42 (Cysteine), and GLN 
43 (Glutamine) of its putative active site AS. EGF has a complicated three-dimensional 
configuration, of which Figure 1a is a two-dimensional projection. Figure 1b is another 
projection of the molecule, rotated 90 degrees around the Y-axis which is shown for 
reference in each of these two figures, [Column 3, Lines 26-37]). 

It would have been obvious to an ordinary person skilled in the art at the time of 
the invention was made to incorporate the teachings of Blanev with the teachings of 
Agrafiotis for the purpose of identifying the putative active site of a molecule, 
determining the structure of the active site, and, by means of a depth-first recursive 
procedure, generating designs of molecules which mimic that site to a desired degree of 
accuracy ([Column 1, Lines 58-64] of Blanev ). 

Regarding claim 109, Agrafiotis further discloses the evolutionary algorithm is a 
genetic algorithm (The approach employs an iterative algorithm based on subset 
refinements to nonlinearly map a small random sample which reflects the overall 
structure of the data, and then "learns" the underlying nonlinear transform using a set of 
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distributed neural networks, each specializing in a particular domain of the feature 
space, [0045]). 

Regarding claim 110, Agrafiotis further discloses: 

a) for each atom a first and a second set of coordinates defining the position of 
the atom in the two or three dimensional space is calculated {reducing the 
dimensionality of high-dimensional data in a way that preserves the original 
relationships of the data objects, and 2) producing Cartesian coordinate vectors from 
data supplied directly in the form of similarities or proximities, [0012]); 

b) a fitness score of the matrix of distances among the atoms determined by 
means of the first and second set of coordinates defining the position of each atom in 
the two or three dimensional space is calculated by using the matrix of the known or 
measured distances between the atoms as a fitness function (pairwise relationship and 
the current distance of the corresponding objects on the nonlinear map are illustrated in 
Figure 1, where shaded areas 110, 112 and 1 14 denote allowed ranges for a given 
pairwise relationship, [0075]); 

c) the first and second set of positional coordinates of each atom are combined 
according to predetermined combination rules, thus producing at least a new first and 
second set of positional coordinates for each atom (Note that new patterns in R" that 
not in the original input set can also be projected into Ff 1 in the manner shown in Figure 
6. Once the system is trained, new patterns in Ff are mapped by identifying the nearest 
local network and using that network in a feed-forward manner to perform the 
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projection. The input for the system is a pattern 705 in R". This point is defined by its n 
attributes, x is x 2 , . . . x n ). The system includes a dispatcher module 710, which 
compares the distance of the input point to the network centers (i.e., the reference 
points), and forwards the input point to one of the available local neural networks 701, 
702, or 703. Specifically, the input pattern is sent to the local neural network associated 
with the reference point nearest to the input pattern. The chosen network then performs 
the final projection, resulting in an output point in R m , [01 1 0]); 

d) a fitness score of said new first and second set of positional coordinates is 
calculated according to (b) (The points (objects) in each cell of the grid are then 
subjected to a phase of local refinement (i.e. only objects from within the same cell are 
compared and refined). Preferably, the number of sampling steps in each cell should be 
proportional to the number of objects contained in that cell. This process is highly 
parallelizable. This local refinement phase is then followed by another global refinement 
phase, and the process is repeated for a prescribed number of cycles, or until the 
embedding error is minimized within a prescribed tolerance, [0086]); 

e) said new first and second set of positional coordinates are again combined 
according to (c) (The points (objects) in each cell of the grid are then subjected to a 
phase of local refinement (i.e. only objects from within the same cell are compared and 
refined). Preferably, the number of sampling steps in each cell should be proportional to 
the number of objects contained in that cell. This process is highly parallelizable. This 
local refinement phase is then followed by another global refinement phase, and the 
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process is repeated for a prescribed number of cycles, or until the embedding error is 
minimized within a prescribed tolerance, [0086]); and 

(c) through (e) are repeated until at least one new first or second set of positional 
coordinates reaches a maximum fitness score or is greater than a minimum predefined 
fitness score (If dg is larger than 

i"maxi the coordinates of the objects are updated using 
r max as the target distance (Equation 9), [0071]). 

Regarding claim 111, Aqrafiotis further discloses for each combination of at least 
one first and one second set of coordinates several new sets of coordinates are 
obtained by combining said at least first and second set of coord inates(A/ote that new 
patterns in Ff that not in the original input set can also be projected into Ff in the 
manner shown in Figure 6. Once the system is trained, new patterns in Ff are mapped 
by identifying the nearest local network and using that network in a feed-forward 
manner to perform the projection. The input for the system is a pattern 705 in Ff. This 
point is defined by its n attributes, x is x 2 , . . . x n ). The system includes a dispatcher 
module 710, which compares the distance of the input point to the network centers (i.e., 
the reference points), and forwards the input point to one of the available local neural 
networks 701, 702, or 703. Specifically, the input pattern is sent to the local neural 
network associated with the reference point nearest to the input pattern. The chosen 
network then performs the final projection, resulting in an output point in Ff 1 , [01 10]). 

Regarding claim 112, Aqrafiotis further discloses at least one hidden or 
hypothetical atom is added to the database of geographical sites of which neither the 
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coordinates nor the distances are known and a first and a second set of coordinates for 
said at least one atom are freely defined (The general algorithm described above can 
also be applied when the pairwise similarity matrix is incomplete, i.e. when some of the 
pairwise similarities are unknown, when some of the pairwise similarities are uncertain 
or corrupt, or both of the above, [0066]). 

Regarding claim 113, Aqrafiotis further discloses the calculation of the 
evolutionary algorithm is carried out in parallel for the database provided with and 
without the at least one hidden or hypothetical atom (multi-layer feed-forward networks 
trained to reproduce their inputs as desired outputs. They consist of an input and an 
output layer containing as many neurons as the number of input dimensions, and a 
series of hidden layers having a smaller number of units. In the first part of the network, 
each sample is reorganized, mixed, and compressed into a compact representation 
encoded by the middle layer. This representation is then decompressed by the second 
part of the network to reproduce the original input, [001 1]) and the best fit set of 
coordinates of position of the totality of the atoms obtained by the two parallel 
calculations are compared (The points (objects) in each cell of the grid are then 
subjected to a phase of local refinement (i.e. only objects from within the same cell are 
compared and refined). Preferably, the number of sampling steps in each cell should be 
proportional to the number of objects contained in that cell. This process is highly 
parallelizable. This local refinement phase is then followed by another global refinement 
phase, and the process is repeated for a prescribed number of cycles, or until the 
embedding error is minimized within a prescribed tolerance, [0086]). 
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Regarding claim 116, Agrafiotis further discloses determine the presence and/or 
position of at least an unknown or hidden atom in the structure of the molecule {The 
general algorithm described above can also be applied when the pairwise relationship 
matrix is incomplete, i.e. when some of the pairwise relationships are unknown. In this 
case, a similar algorithm to the one described above can be used, with the exception 
that the algorithm iterates over pairs of objects for which the relationships are known. In 
this case, the algorithm identifies configurations in space that satisfy the known pairwise 
relationships; the unknown pairwise relationships adapt during the course of refinement 
and eventually assume values that lead to a satisfactory embedding of the known 
relationships, [0068]). 
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Conclusion 

1 9. These following prior arts made of record and not relied upon are considered 
pertinent to Applicant's disclosure: 

Faqin et al. (Pub. No. US 2004/0249831) teaches efficient similarity search and 
classification via rank aggregation. 

Lee (Pub. No. US 2004/0088341) teaches method for converting a multi- 
dimensional vector to a two-dimensional vector. 

The Examiner requests, in response to this Office action, support(s) must be 
shown for language added to any original claims on amendment and any new claims. 
That is, indicate support for newly added claim language by specifically pointing to 
page(s) and line no(s) in the specification and/or drawing figure(s). This will assist the 
Examiner in prosecuting the application. 

When responding to this office action, Applicant is advised to clearly point out the 
patentable novelty which he or she thinks the claims present, in view of the state of the 
art disclosed by the references cited or the objections made. He or she must also show 
how the amendments avoid such references or objections See 37 CFR 1.111 (c). 
Contact Information 

20. Any inquiry concerning this communication or earlier communications from the 
Examiner should be directed to Son T. Hoang whose telephone number is (571) 270- 
1752. The Examiner can normally be reached on Monday - Friday (7:00 AM - 4:00 
PM). 
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If attempts to reach the Examiner by telephone are unsuccessful, the Examiner's 
supervisor, Christian Chace can be reached on (571) 272-4190. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for published 
applications may be obtained from either Private PAIR or Public PAIR. Status 
information for unpublished applications is available through Private PAIR only. For 
more information about the PAIR system, see http://pair-direct.uspto.gov. Should you 
have questions on access to the Private PAIR system, contact the Electronic Business 
Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO 
Customer Service Representative or access to the automated information system, call 
800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

/Son T Hoang/ 
Examiner, Art Unit 2165 
November 6, 2008 

IS. P.I 

Primary Examiner, Art Unit 2164 
/Christian P. Chace/ 

Supervisory Patent Examiner, Art Unit 2165 



